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DAVID C. SCHMITTLEIN AND ROBERT A. PETERSON 
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Customer base analysis is concerned with using the observed past purchase behavior of customers 
to understand their current and likely future purchase patterns. More specifically, as developed 
in Schmittlein et al. (1987), customer base analysis uses data on the frequency, timing, and dollar 
value of each customer’s past purchases to infer 

e the number of customers currently active, 

how that number has changed over time, 

which individual customers are most likely still active, 

how much longer each is likely to remain an active customer, and 

how many purchases can be expected from each during any future time period of interest. 

In this paper we empirically validate the model proposed by Schmittlein et al. In doing so, we 
provide one of the few applications of stochastic models to industrial purchase processes and 
industrial marketing decisions. Besides showing that the model does capture key aspects of the 
purchase process, we also present a more effective parameter estimation method and some results 
regarding sampling properties of the parameter estimates. Finally, we extend the model to explicitly 
incorporate dollar volume of past purchases. 

Our results indicate that this kind of customer base analysis can be both effective in predicting 
purchase patterns and in generating insights into how key customer groups differ. The link of 
both these benefits to industrial marketing decision making is also discussed. 

(Estimation and Other Statistical Techniques; Industrial Marketing; Measurement; Promotion) 


1. Introduction 


To judge from the published literature, it would be easy to conclude that detailed 
probabilistic analysis of customer purchase patterns in industrial markets is ineffective 
and/or irrelevant. This is not to say that no such examples exist (a counterexample is 
Uncles and Ehrenberg’s (1990) study of aviation fuel contracts), but they are virtually 
nonexistent in the overviews provided by Ehrenberg (1988), Greene (1982), Lilien et 
al. (1992), and Massy et al. (1970). 

The stereotypes of industrial markets suggest several reasons for the absence of such 
analyses. (For a summary of stylized distinctions between industrial and consumer markets 
see Webster and Wind (1977) or Jackson and Cooper (1988).) One is the lack, in some 
industrial markets (e.g., nuclear power plant construction), of any appreciable degree of 
repeat buying over the relevant time horizon. Such a lack of repurchase behavior does 
not, however, characterize all industrial markets. Indeed, the notion of “relationship 


41 


0732-2399/94/1301/0041$01.25 
Copyright © 1994, The Institute of Management Sciences/Operations Research Soctety of Amenca 


Copyright © 2001 All Rights Reserved 


Downloaded from informs.org by [137.132.123.69] on 02 November 2015, at 06:14 . For personal use only, all rights reserved. 


RIGHTS 


42 DAVID C. SCHMITTLEIN AND ROBERT A. PETERSON 


marketing” (Kotler 1991, pp. 8, 205) is based entirely on the importance of repeat 
buying. 

Another plausible reason is the generally smaller number of customers in industrial 
markets compared to consumer markets. Formal models may provide little value if the 
relative handful of key customers can simply be telephoned. But this again does not 
characterize all industrial marketing. Having (say ) 20,000 business customers ofa potential 
market of 80,000 firms is certainly smaller than 20 million consumer households out of 
80 million in total. But a summary of past and projected customer activity cannot simply 
stem from speaking personally to each customer, and the salesforce may lack the will- 
ingness or ability to undertake such individual analyses. 

A third possible explanation is the supposed paucity of data on industrial product 
customer purchases (relative to supermarket packaged goods). Like the two preceding 
explanations, this one also contains a grain of truth, but does not apply universally. 
Indeed, many industrial marketers have a wealth of information detailing past purchases 
by their customer base (even more detailed information about individual customers than 
retail marketers). What these firms lack, relative to consumer markets, is (sometimes) 
systematic information about what other firms were offering at the same time, and (usu- 
ally) systematic information about customer purchases of those other firms’ goods/ser- 
vices. Thus they do not have the complete purchase record (and “store environment” 
information ) for a set of households of the type available in consumer nondurables from 
market research firms such as A. C. Nielsen or Information Resources, Inc. 

This data limitation does inhibit some probabilistic modeling applications. In fact, it 
rules out the logit-type analyses of customer product choice that have become so popular 
for consumer packaged goods (e.g., Gaudagni and Little 1983; Lattin and McAlister 
1985; Zufryden 1986). On the other hand, industrial marketers have an advantage in 
that their customer base typically represents a census of their customers, rather than the 
sample employed by consumer goods firms. This is a major benefit in using individual- 
level purchase models for direct marketing/ targeting decisions. In this respect they re- 
semble a variety of consumer goods businesses having such purchase information for 
their entire customer base, e.g., many service businesses (medical offices, brokerage firms) 
and sellers of nonsupermarket goods (department stores for charge card purchases, mag- 
azine publishers). Besides this similarity in customer information available, these firms 
share another important quality, namely the ability to benefit from answers to questions 
like: 

(1) How many active customers does the firm now have, and has the number been 
growing or declining? 

(2) Which individual customers are most likely to represent active customers? Inactive 
customers? 
and probably most important 

(3) What level of transactions should be expected next period (e.g., year) by those on 
the customer list, both individually and collectively? 

The answers to these questions have implications for very long-range decisions (valuing 
the business via the future cash stream of its current customers), intermediate-range 
actions (how much should be spent to attract new customers and/or retain current 
customers, i.e., what is the long-term value of a new/current customer ), and short-range 
tactics (which customers should be sent direct marketing incentives to attempt to induce 
immediate purchases). As indicated previously in Schmittlein et al. (1987, hereafter 
SMC), these questions are not answered well by aggregate sales data or trends alone. 

The model proposed in SMC provides answers to the three questions above for any 
firms maintaining detailed purchase information (purchase frequency and timing) for 
their own customer base. For a new industrial customer after initial trial, it envisions a 
repeat purchase process having the following major features: 

(1) Some customers become inactive, i.e., deterministically no longer purchase, at 
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various points in time. To an observer, the time for which each customer remains active 
would (if known) appear to have a random component, that is, not all customers become 
inactive at the same time. Clients may become inactive for any of several reasons, e.g., 
because they no longer need the product/service being offered, they have switched to 
another supplier, or they have themselves gone out of business. 

(2) While active, customers purchase whenever they wish, with interpurchase times 
also appearing to have a random component, i.e., purchases do not happen perfectly 
regularly in time. 

(3) When a customer becomes inactive, the selling firm is not notified directly (or at 
least that information is not recorded in the customer base). Thus, the only (and partial ) 
evidence that a customer may have become inactive is a suspiciously long hiatus since 
the last observed purchase. 

(4) Customers may differ from each other in both their purchase rates (while active) 
and in their “dropout” rates (i.e., their propensity to become inactive customers). These 
two rates are treated as varying independently of each other across customers. 

The need for a probabilistic modeling approach as in SMC stems most directly from 
the nonnotification (#3) and random purchasing (#2) features. If the selling firm is 
generally notified explicitly (as in terminating a long-distance telephone company ), then 
the number of active customers is known directly, without any analysis. Further, even 
without notification, if purchases were made perfectly regularly while a customer is active, 
dropout would be evident as soon as this customer’s average (and constant) interpurchase 
time is surpassed. We do not wish to oversell the range of applicability of this paper’s 
modelling effort. Industrial markets are extremely diverse, and certainly many do not 
possess the characteristics listed above. But on the other hand many industrial markets, 
including the one to be analyzed in this paper, are well approximated by the four fea- 
tures above,! 

Although we view the SMC model as an appealing framework for answering the ques- 
tions posed earlier for many industrial marketers, its original presentation suffered from 
several drawbacks. The application reported in this paper is intended to overcome what 
we see as the four most major of these. 

1, Empirical Validation. No empirical evidence with the SMC customer transaction / 
retention model has yet been published. We rectify this with our empirical application. 
The results provide evidence that the model captures the major characteristics of the 
purchase / dropout process, yields useful insights into the underlying process followed by 
customers, and has well-calibrated forecasts of future purchase patterns. 

2. Parameter Estimation. The method suggested in SMC to estimate the model’s 
parameters proved unreliable, so we have developed an improved approach in this paper. 

3. Sampling Properties. Our bootstrap standard errors allow us to test for the existence 
of customer subgroups and provide some tentative guidelines for sample size and sample 
construction (e.g., is it better to use more customers from the available database, or fewer 
customers for a longer observation period?). 

4. Dollar Volume of Purchases. The SMC model considers transactions only—not 
total dollar volume generated. The distinction between the two is particularly severe in 
industrial markets, where customers do not typically purchase just a single quantity of 
a standard item offered at a standard price. We extend the SMC model to incorporate 
dollar volume of purchase. 

Besides these specific contributions, we hope to illustrate the potential for application 
of probabilistic models to the kinds of customer bases that industrial marketers generally 
actually possess. 


' Schmittlein et al. (1987) describe in more detail the informational and market /environmental characteristics 
needed to use this type of model. 
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Before proceeding to the model development and empirical results we should say a 
few words about the organization and decision-making context for which this article’s 
application was constructed. The company manufactures and markets a variety of rec- 
ordkeeping systems, business forms, stationery, printed forms, and so forth through a 
nationwide network of independent distributors. These distributors typically have exclu- 
sive geographic areas in which to market the firm’s products. Moreover, the distributors 
are contractually prohibited from carrying competing products. For the most part, the 
firm’s product line consists of consumable items, many of which are replaceable with 
competitors’ offerings. 

There were several “needs” that collectively motivated this research. Although the 
firm had complete purchase records on its customers, and although it would archive 
customers after a certain period of inactivity, it wanted an independent, objective method 
for estimating the number of customers it actually had. This desire related to two issues 
the company was attempting to resolve. One issue was whether it should attempt to 
establish a direct distribution channel to its current customers. In other words, the firm 
was considering a couple of direct marketing approaches (e.g., mailing catalogs to cus- 
tomers and/or taking their orders directly and/or instituting a telemarketing operation ) 
that would operate independently of (and, in one sense, compete with) the distribution 
system currently being used. For a direct marketing channel to be effective, it would be 
important to establish the timing of contacts (i.e., if and when should the customer be 
sent a catalog), estimate the value of a customer, etc. 

The second issue related to the value of specific customer subsets. In particular, dis- 
tributors effectively “own” their customers or accounts in their geographical area. Some- 
times distributors attempted to sell their accounts to other entities or back to the firm or 
the firm attempted to buy out distributors. Both of these actions require an objective 
assessment of the value of a particular customer base. At the time of our research, the 
firm used a rule-of-thumb in arriving at the value of a customer base. This rule-of-thumb 
typically produced results that were not acceptable to either the distributor or the company 
and, consequently, all too frequently resulted in litigation. 

The modelling effort reported here was designed to address both of these key issues, 
i.e., coordinating a direct marketing effort, and assessing the value of any particular group 
of current customers. As background to this application, the next section provides an 
overview of the customer purchase /retention model proposed in SMC, followed by de- 
velopment of a new parameter estimation method. Subsequently, we present an extension 
of the SMC model to include explicit analysis of the dollar purchase volume. This applies 
to industrial applications where a single purchase (order) can involve multiple different 
products at a nonstandard / negotiated overall purchase price for the order, and serves as 
the basis for estimating the value of a specific customer or customer group. After describing 
the industrial customer database available for this study, we present the empirical ap- 
plication for our extension of SMC’s transactions model. This includes individual cus- 
tomer purchase forecasts, aggregate forecasts, customer base segmentation, and sampling 
issues. Finally, we validate the extended SMC model in several ways. First we examine 
customer reorder patterns to assess the dropout phenomenon assumption above. This 
assumption regarding the ability to infer active /inactive customer status is then put to 
a second test via a telephone survey to a subset of the customer base analyzed. Lastly, 
we cross-validate the model’s individual customer-by-customer predictions of the number 
of reorders, and the total dollar volume of reorders, in a future time period. 
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To answer the managerial questions posed in the Introduction, we rely on the observed 
history of customer transactions with the firm of interest. This history is interpreted 
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through the prism of the customer transactions / customer retention model described here 
to provide the desired insights and predictions. Schmittlein et al. (1987) derived the 
properties we will need for the basic model (ignoring, for the moment, the dollar volume 
of transactions). They also discuss at length the sort of information and market conditions 
needed for its application. The two key conditions for this model to hold are that (1) 
customers can reorder whenever they wish, and (2) the time at which a customer becomes 
inactive is not directly observable by the supplying firm. Here we will simply review the 
model’s assumptions and list the key equations. 

The decision to base customer inferences and predictions specifically on past transaction 
patterns is a deliberate one. The major reasons are 

(1) past purchase patterns are widely available, 

(2) past purchase patterns tend to be effective predictors, and 

(3) past purchase patterns can be rich predictors. 

Point (1) was discussed earlier. The relative ability of past purchase behavior to capture 
subsequent purchase propensities is well known, especially in direct marketing, where it 
generally outpredicts other geodemographic customer information available. Similarly, 
choice modelers often find purchase history to be much more predictive than marketing 
mix variables such as price or promotions (Fader and Lattin 1993; Guadagni and Little 
1983). In spite of this, one might well ask ‘“‘why not include past purchase, geodemo- 
graphics, and marketing mix efforts as joint predictors of future purchases?” Indeed, we 
are as interested as anyone in the latter two effects; and, lacking any relevant empirically- 
validated theory, we would join those adding the latter effects to the right side of ad hoc 
regression or logit-type predictive models. But we do not lack such a theory for the 
implication of past purchases. Accordingly, we use the theory to detail the unique role 
played by past purchase patterns both to indicate very specifically their link to inferences 
and predictions of interest, and to form a rich framework in which various kinds 
of geodemographic and marketing mix effects can be considered. This brings us to 
point (3). 

It would certainly be possible to calibrate a model predicting each of the following 
quantities using a sensible set of purchase history information (e.g., purchase frequency, 
timing of most recent purchase ) and some convenient (e.g., logistic) model formulation: 

(a) the number of transactions made by a customer in the next 7, months; 

(b) the probability that a customer will make at least K transactions in the next 7> 
months, and 

(c) the number of months that will elapse until the customer’s next transaction. 
Unfortunately, calibrating a model for (a) tells us nearly nothing about (b) and (c), and 
vice-versa. Further, the analysis must be completely redone every time the cutoff K or 
time period (7,, T>) of interest changes. 

In contrast, the modeling approach exemplified in SMC provides predictions for all 
the quantities (a)-(c), for each value (K, 7,, 72) from a single parsimonious set of 
parameters. In addition, those parameters provide convenient “‘entry points” for ex- 
amining the impact of geodemographic or marketing mix variables. Specifically, we can 
envision these factors affecting a customer’s purchase rate, dropout rate, or both. This 
type of extension to the basic model is in the spirit of Gupta (1988, 1991), Jones and 
Zufryden (1980), and Wagner and Taudes (1986), all of which incorporate exogenous 
variables in a richly predictive model responsive to the special role played by past purchase 
patterns in repeat purchase processes. We will return to such extensions after describing 
mathematical particulars of the SMC model, and provide an empirical application in 
the segmentation results to follow. 


SMC Model Assumptions 


1. Transactions by Active Customers. While active, any customer makes transactions 
with the firm of interest that are randomly distributed in time, with some customer- 
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specific long-run transaction rate \. There is substantial support in consumer markets 
for the effectiveness of this assumption (Ehrenberg 1988; Morrison and Schmittlein 1988). 
In industrial markets some indirect support is provided by Uncles and Ehrenberg (1990). 
While active, then, the number of transactions XY, made by a customer in time period 
of length ¢ is a Poisson random variable: 
x 
px=xir,q= SP om, x=0,1,2,.... (1) 
2. Individual Customer Retention/ Dropout. Any customer is viewed as remaining 
“active” with the firm of interest for some (not directly observed ) time period of duration 
r. After that time (which may, of course, be very long), the customer no longer purchases 
from this firm. This customer’s dropout phenomenon is assumed to occur randomly in 
time according to some rate yu. That is, the customer, having been active for ¢ months, 
has a likelihood p of dropping out (becoming permanently inactive) in the next short 
time period, regardless of t. Thus, the time spent as an active customer is an exponential 
random variable, with density 


S(t|u) = pe“ tt > 0. (2) 


Via assumptions (1) and (2) each customer is characterized by two traits: a purchase 
rate \ and dropout rate p. 


3. Heterogeneity in Transaction Rates. Naturally, some customers have high purchase 
rates while others buy infrequently. This heterogeneity in transaction rates across cus- 
tomers is assumed to follow a gamma distribution, with p.d.f. 


a’ 


I(r) 


flr, a) = ATH ea; A>0, r,a>0. (3) 


The mean purchase rate across customers is E[A] = r/a and the variance is r/a?. The 
parameter r is an index of the homogeneity in purchase rates across customers. 


4. Heterogeneity in Dropout Rates. Not all customers drop out at the same time, or 
with the same rate u. In the SMC model, dropout rates « are assumed to vary across 
customers according to a gamma distribution (unrelated to (3) above), with p.d.f. 


Ss 


pte op > 0, 5, 8>0. (4) 


g(uls, 8) = T(s) ; 


The mean dropout rate is E[u] = s/6 and the variance is 5/8”. The parameter s is an 
index of the homogeneity in dropout rates cross customers. 


5. Rates \ and » are Independent. The purchase rate X and dropout rate yu are 
assumed to vary independently across customers. 


Key Mathematical Results 
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Several of SMC’s formulas will be helpful in estimating the four model parameters 
(r, a, s, 8) and answering the managerial questions raised in the Introduction. For a 
randomly chosen customer, the expected number of transactions made in T units of time 
following an initial purchase is 


s-l 
E[X|r, a 5, B, n=" 5[! - (543) i (5) 
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The variance, across customers, in the number of such transactions is 


Var [X|r, a, 5, 8, T] = E[X|r, a, 5, 8, T] — (EL[X |r, a, s, 8, T])’ 


4 art 8 B 8 ( B a B y 
e(s—1)|s-2 s—-2\B+T B+T , 


(6) 


Two additional results will suffice to provide the desired inferences and predictions. 
The first is the probability that a customer with a particular observed transaction history 
is still active at time T since trial. SMC show that this probability depends on the customer’s 
past purchase history only through the number of purchases X and the time ¢ (since trial) 
at which the most recent transaction occurred. Thus we denote this desired probability 
as P[Active|r, a, s, 8, X, t, T]. Its formula is given in Appendix Al. This result for 
P[Active] enables us to compute the expected number of active customers at any point 
in (calendar) time, and to rank customers with respect to the likelihood that each is still 
active. P[ Active] has some natural application contexts, e.g., in targeting customers who 
have become inactive recently. 

Other applications focus more on forecasts of future purchases. (Note that, at some 
time, a customer may have only a small chance of still being active, but if active, have 
a high expected future purchase rate.) In these cases we are interested in the expected 
number of future purchases X*, to be made in some future period of length 7*, for a 
customer with observed purchase history (X, t, 7) as above. SMC show that this expec- 
tation has a very simple form, namely 


EL[X* |r, Q, 5, B, X, t, T, i*] 
= P[Active|r, a, s, 6, X, t, TJE[X|r*, a*, s*, B*, T*] (7) 


where r* = 7+ X,a* =a+T,s* =5, B* = 8 + T, and the expectation on the right- 
hand side of (7) is that given earlier in (5). In other words, after seeing history 
(X, t, T), we only expect to see future purchases if the customer is still active. Jf the 
customer has not dropped out, the process acts as if it starts over at time 7, but with 
new (updated) parameter values r*, a*, s*, and 6*. 

Equation (7) allows us to predict future transaction levels either separately customer- 
by-customer or for pooled sets of customers. In the latter case, we can forecast the total 
number of purchases through any future period 7* of interest, for an entire set of current 
customers of the firm. It reveals, in particular, the number of new customers that must 
be acquired by the firm (e.g., by its salesforce) to reach a given sales target. This is 
particularly important in industrial marketing, and is nor directly available from aggregate 
sales data alone (due to the dropout phenomenon that is not directly observable). SMC 
include additional properties of the model discussed here, and further discuss the link to 
specific management decisions. 

We also mentioned in the abstract that the SMC model can be used to predict how 
much longer a customer will remain active. For one just placing an initial order, this is 
just the aggregate distribution for dropout time, i.e., the mixture of Equation (2) with 
Equation (4) as prior. This gamma mixture of exponential random variables is a Pareto 
distribution, with a discussion and its formula given by Morrison (1978) and Schmittlein 
and Morrison (1983b). After observing a customer for some time, the expected remaining 
time as an active customer is either zero (if the customer is not now active), an event 
with probability 1 — P[Active], or is simply an updated mixture of Equation (2) with 
(4); 1.e., the same Pareto distribution as above, but with the updated dropout process 
parameters (s and 6) that were given for Equation (7) above. 
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One final characteristic of the SMC model merits explicit mention. In some business- 
to-business markets, customers deal with multiple suppliers and switch among them 
from order to order. One might ask if such switching (not directly observable by the 
supplying firm) will invalidate the SMC model. That is, while a customer remains active 
will reorders from this supplier still follow the NBD model (Ehrenberg 1988), when other 
suppliers are also used? Schmittlein et al. (1985) show that the answer to this question 
is “yes.” That is, under some conditions the NBD will be completely unaffected by this 
switching; and even if such a condition does not hold precisely, the NBD is very robust 
to switching among multiple suppliers. 


3. Extensions of the SMC Model and Methodology 


Here, we describe an improved estimation methodology for the SMC model, and then 
extend that model to incorporate the dollar volume of purchases. 


A Two-step Parameter Estimation Method 


In estimating the four model parameters (r, a, s, 8), SMC proposed a three-step 
method of moments procedure. That approach relies on the ratio of values for Equation 
(5) above for time periods of different length. They did not actually apply the approach 
to any purchase data, and in our experience these ratios for different (7, Tz) values are 
too unstable to provide reliable parameter estimates. 

As an alternative, we suggest the two-step method of moments estimation detailed in 
Appendix A2. It is much more tractable than maximum likelihood estimation, which 
would require multiple evaluations of the Gauss hypergeometric function for each reorder 
in the likelihood function—see (Al) and (A2). The first step will extract as much in- 
formation as possible out of the first moment (i.e., average observed reorders), since 
lower-order moments usually have better sampling properties. This produces an estimate 
for three of the four model parameters. The fourth is estimated by fitting a second moment, 
i.e., the observed variance in reorders. Bootstrap estimates of these parameter sampling 
variances are reported in the results to follow. 


Extending the SMC Model to Incorporate Dollar Volume 


We have discussed the importance in industrial markets of extending SMC’s model 
to incorporate explicitly the dollar volume of orders. Equation (7) provides a prediction 
of future transactions for each customer, and one simple strategy would be to multiply 
this expectation times the average dollar volume of past reorders made by this customer. 
Notice that we now characterize each individual customer by three latent traits: the 
transaction rate (X), dropout rate (4), and average dollar volume per reorder (@). 

There are two problems with the simple strategy above. The first is what to do with 
those accounts that happen not to have reordered yet—what average dollar volume per 
reorder 6, should be expected of such customers? One solution to this first problem is to 
substitute the average reorder volume for all customers who have reordered, using this 
overall average E[@], as our estimate 6 for each customer with no reorders. But using 
this solution brings us to the second problem with such a simple strategy. Having used 
the behavior of other customers to estimate 6 for some customers lacking any reliable @ 
using past purchases (i.e., those without reorders), why not use this overall E[6] to help 
estimate 6 for any customers where § based on past purchases is unreliable (e.g., those 
with only one or two reorders)? We will propose a model for reorder quantities that 
enables us to do just this, and so provide a more stable and accurate estimate of future 
reorder volume @ for each customer. 

For any phenomenon with a random component, such as the volume purchased from 
occasion to occasion for a particular customer, we know that in anticipating future order 
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volumes, we should expect a regression effect toward the average across-customer volume 
E[6] (Schmittlein 1989). That is, a customer whose observed historical reorder dollar 
volumes were relatively low was probably a bit “unlucky” on those particular occasions. 
The expected future volume will generally be greater than the historical average for this 
customer but still less than the overall population average E[6]. The opposite occurs for 
customers with high observed historical order sizes. Further, this statistical regression- 
to-the-mean effect is more pronounced when the customer’s historical average order size 
is based on only a small number of observations. We propose a model—that can be used 
in conjunction with the SMC model—to anticipate this regression effect and provide an 
effective estimator of 9 for each customer. Then individual customer volume predictions 
can be formed by multiplying this estimator by SMC’s Equation (7) for transaction 


frequency. 
For a particular customer observed to have made _X reorders to date, let Z, denote the 
dollar volume of order i (i = 1,..., X). Then of course 


E{Z;\6] = 6 for all i. (8) 
Our objective is to compute the reverse conditioned expectation 
E[@|Z,,..., Zx] (9) 


which will be the expected dollar volume for future reorders by this customer. As SMC 
did for the transaction/retention process, we use three sets of assumptions that will 
enable the updating indicated in (9). 


(1) CUSTOMER-LEVEL ASSUMPTION. We assume that the set of Z;,i = 1,...,X 
are i.i.d. normal random variables with mean 6 and some variance oj which is constant 
across customers. o4y is thus the within-customer variance in the amount spent across 
reorders. 


(2) HETEROGENEITY ASSUMPTION. The average amount spent per order 6 is assumed 
to vary across customers according to a normal distribution with mean E[6] and variance 
a4. «4, is thus the variance in average amount spent across customers. 

(3) INDEPENDENCE FROM THE TRANSACTION/ RETENTION PROCESS. The distri- 
bution of average amount spent 8 across customers is assumed to be independent of the 
distribution of the transaction rate » and the dropout rate q. 


We will motivate these assumptions shortly, but first we present the resulting formula 
for the conditional expectation of interest (9). With Assumptions (1) and (2), the con- 
fidence that one should place in a single observed past order amount Z, (relative to 
relying on the population average E[6]) is the reliability coefficient 


o4 


ed ee 1 
Pi oi tay (10) 


Then if only _X = | previous order was observed, the best estimate for @ as in (9) becomes 
(Schmittlein 1989) 


E[@|Z,] = o1Z, + 1 — 9) E94]. (11) 


In general, if X = 1 historical reorders were observed, the reliability of the historical 
average 


x 
Z= > Zi (12) 


cs 
xX 
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is 
2 


oi AS (13) 
Ox 4 + (aw/X)’ 
and the expected future volume per reorder (9) is 
Xo3 = o%, 
E(é@|Z,,..., Zx] = (sa ZZ + 1 SO ETS). 
[ | 1 x] (etx)? (ata) [9] (14) 


To compute an expected future dollar volume from a customer with a given purchase 
history (X, t, T, Z), we multiply the updated volume/reorder estimate (14) by the 
expected future number of transactions (7). 

In arriving at the key Equation (14), Assumptions (1) and (2) were convenient but 
not necessary. That formula can also be derived from the standpoint of minimizing 
squared prediction error in 6 without the normality assumptions (Gerber 1979, Chapter 
6).? Finally, Assumption (3) is a “real” constraint in that we need to isolate the updated 
é-value in (14) from the updated transaction/retention process in (7) (and keep the 
mathematics tractable). Its validity may vary from application to application. In the 
validation section below, we show that independence of reorder dollar volume from 
typical interpurchase time is tenable for the current firm’s customer base. We next describe 
that base of customers in more detail. 


4. An Industrial Customer Database 


The firm whose customer base is analyzed here sells a variety of office products to 
business customers. We examined transactions made through May 1989. At that time 
this firm maintained purchase records for several hundred thousand customers with 
whom it had ever done business. Thus, the firm had access to (and retained) detailed 
information on each transaction made by each customer. Each record in the database 
details a sale of some quantity of a particular product to a given customer shipped on a 
certain date for a particular dollar amount. The key transaction of interest is not, however, 
the individual product record but rather an order placed by a customer on a certain date. 
An order frequently includes multiple different products, and although the product choice 
and quantity decisions could themselves be the focus of interesting modeling, we consider 
only the overall order and its dollar volume in this paper. These two quantities (order 
date and dollar volume per order) are the ones that generalize most easily across industrial 
markets, coincide with the SMC framework, and provide the basis for forecasting the 
future customer activities of most interest. 

Also accompanying each purchase record is that customer’s entry date into the database, 
i.e., the time at which the customer’s initial order was placed. This is important since 
purchase records are kept by the firm covering only a “window” of slightly over three 
years (generally March 1986 through May 1989 in our case). For customers placing an 
order within this period, we know all transactions made within the window, and know 
the date of initial purchase (even if before March 1986), but do not know about any 
other orders these customers may have placed prior to 1986. Further, for customers that 
place no orders at all within the window here, no information at all is available (i.e., not 
even their entry date into the system). These individuals have already been dropped 
from the current customer database maintained by the firm, using an ad hoc heuristic 
(i.e., the three-year rule above). Like the firm, we do not see these omitted customers 
as providing significant future sales—a conclusion also supported by our empirical 
results below. 


? The least-squares criterion for generating (14) does not, however, provide an estimate of the entire conditional 
distribution for 6, which the normal distribution assumption does supply. 
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Returning to customers entering before March 1986 and included in the database, we 
need to reset / update the four gamma distribution parameters for each such customer as 
of March 1986. We do so to include the relevant information on their purchase history 
prior to that date. Since no information on orders placed in the pre-March 1986 period 
is available, the purchase rate parameters (r, a) are the same in March 1986 as if an 
initial purchase had been made then. But the time since the entry date does provide 
information on the dropout parameters (s, 8), since it is known that the customer passed 
through the period (entry date, March 1986) without becoming inactive (since a trans- 
action was made post-March 1986). So the parameter 8 is increased by the amount of 
elapsed time between the entry date and March 1986 for these customers. With this 
updating, modeling and predictions for these individuals in the post-March 1986 period 
proceeds normally, i.e., in the same fashion as for those customers (entering post-March 
1986) whose entire purchase history is observed. For additional discussion of this “left 
filtering” of event histories which is common in industrial databases see Schmittlein and 
Morrison (1983a). Approximately 60 percent of the customers in the database analyzed 
here entered prior to March 1986, and were thus left-filtered. 

For this firm the average order size is approximately $100, with a substantial variation 
both across customers, and within customers across orders placed. Examining each cus- 
tomer’s repurchase activity after one (randomly chosen) transaction, the median inter- 
order time is about seven months. 

Consistent with the various purposes and requirements of the analyses to follow, several 
different subsets of the overall customer base will be used extensively. These customer 
groups, and the number of customers contained in each, are listed in Table 1. The entire 
customer base (A) had over 400,000 customers (over 4.5 million individual purchase 
records). The authors initially had access to a 1/20 sample (B), i.e., every twentieth 
customer, from this base. Within the sample (B) 17,825 customers (C) were observed 
for at least one full month. Also within the customer set (B), 1829 customers were not 


TABLE | 
Summary of Customer Groups Used in Analyses 


Customer 
Group Description 
Total set of customers 


Customers in sample 

Set B customers observed for 
at least one full month 

Set B customers not left- 
filtered and having 30+ 
months of observed 
purchase history 

Set D customers observed to 

have placed at least two 

reorders 


Number of Customers 


Universe 400,000+ (4.5+ million 


purchase records) 


1/20 Sample 


17,825 


1,829 


1,056 


1/10 Sample Customers in sample 40,000+ (450,000+ 


purchase records) 


Set F customers not left- 4,050 
filtered and having 30+ 
months of observed 
purchase history 

Random subset of set F 5,030 


customers tracked for 
extra calendar time 
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left-filtered and had 30+ months of observed purchase history following placement of 
their initial order (D). Finally, 1,056 of the set (D) customers were observed to have 
made at least two reorders (E), and were therefore used in estimating the distribution of 
reorder volumes. Subsequently we were given access to a larger (different) 1/10 sample 
(F). Within this set were 4,050 customers that were not left-filtered, and that had 30+ 
months of observed purchase history (G). Sets (F) and (G) were tracked from 1986 
through 1988. A random subsample of (F), consisting of 5,030 customers (H), was 
followed further in time, through May 1989. 


5, Parameter Estimates 


Parameter Values, SMC Model 


The four parameters of the basic transactions model were estimated for a representative 
sample of N = 4,050 customers (set (G) in Table 1) who each had at least a 30-month 
observed purchase history. With the least squares two-step estimation described above, 
the parameter values are: 


all customers (30-month history): transaction process: r= 0.91, a@ = 3.70; 
dropout process 5 = 0.21, 6 = 0.29. 


From the transaction process parameters, we see that the mean purchase rate while 
the customer is active, is r/ = 0.25 per month, or about three times per year. We should 
emphasize that even a simple statistic such as this cannot be calculated directly, since 
we do not know when the customers who drop out have left. An r-value of 0.91 represents 
a moderate level of heterogeneity in transaction rates across customers (Schmittlein et 
al. 1992). 

The estimated average dropout rate is 5/8 = 0.72. For a customer having that rate, 
the probability that the account becomes inactive in the next month, given that it has 
not yet become inactive, is | — e~°’? = 0.52. It is highly unlikely that such an account 
would “survive” beyond a year. But a second characteristic of the dropout rate distribution 
suggests that few customers are actually near this mean rate of 0.72. Recall that s is an 
index of homogeneity in dropout rates: a value of s = 0.2 is very small. (For example, 
this gamma distribution has much fatter tails than the exponential distribution.) Thus, 
some customers have very small dropout rates (and are likely to remain with the firm 
for a long time) and the rest have quite large rates, becoming inactive within months of 
initial trial. 

These inferences—at least for the long-time customers—seem reasonable. The presence 
of substantial early dropout can also be seen by even a cursory examination of the database. 
By way of illustration, we reproduce below the records for the first ten customers analyzed: 


Number of 
Reorders 
Observed 


Most Recent 
Reorder 


Customer 
No. 


P[Active] 
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1 Pre-3/86 15 99 
2 Pre-3/86 2 18 
3 5/86 1 99 
4 8/86 2 38 
5 10/86 6 95 
6 12/86 0 15 
7 12/87 4 91 
8 3/87 0 .16 
9 5/87 1 05 
10 Pre-3/86 2 1 
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08 Customer C 
° 1 reorder 
at 12 months 
06 7 


Customer B 
PiActive] 1 reorder 


at 6 months 


0.4 


Customer A: 
0.2 no reorders 


0 5 10 15 20 25 30 
Months Since Previous Transaction (Hiatus) 


FIGURE 1. Illustrative Relationships Between Order Hiatus and P[ Active]. 


Three customers (#1, 2, 10) were left-filtered. Among this set of customers half seem 
to be still active (#1, 3, 5, 7, 10) and the other half are probably inactive. This can be 
seen by examining either the right-most column (the model’s P[ Active], computed by 
Equation (A1)) or the customer history itself. All customers appearing inactive based 
on P[ Active] (#2, 4, 6, 8, 9) made 0, 1, or 2 reorders, typically soon after an initial 
order, and then did nothing for a long time period. In contrast, customers 1, 3, 5, and 7 
exhibit a final hiatus that is consistent with their respective average interorder times. 
Finally, customer 10 is a bit unusual. The account experienced a long hiatus, but is 
viewed by the model as probably still active. The reason is that this customer has been 
around for a very long time (not reported in the table above), and thus buying one or 
more times in the window 3/85-—5/89 is sufficient to conclude that the account is still 
active. 

Based on these examples, judging a customer’s active/inactive status may seem easy 
after the fact, i.e., after seeing the value of P[ Active] above. But judging it 4 priori is not 
nearly as easy; and finding a simple ad hoc index that can be applied automatically in 
place of the SMC model is even less so. For example, one simple heuristic used by many 
firms is the duration of the hiatus (7 — ¢) since the most recent order. Some cutoff (e.g., 
a hiatus of two years) is routinely used to characterize a customer as “inactive.” (An 
illustration of this heuristic’s use with frequent flyer programs is given in The New York 
Times (1988).) 

But such “hiatus rules” are not really effective at capturing major qualitative features 
of dropout, as we show in Figure 1. Following an initial order, P[Active] does indeed 
drop with the length of hiatus, i.e., the time since that order for a customer (e.g., customer 
A in Figure 1) with no reorder. It also drops with hiatus length for customers who 
repurchased at either six months (customer B) or 12 months (customer C) after an initial 
order. But across customers with equal hiatus lengths, P[ Active] can still vary greatly. 
Using our estimated SMC model parameters, when it has been 20 months since the most 
recent transaction, customer A’s probability of still being active is only 0.2, while it is 
approximately 0.4 and 0.6 for customers B and C, respectively.? It does not seem to us 
an easy matter to construct a simple index likely to replicate the SMC model very well. 

In summary, both the parameter values and the illustrative customer histories indicate 
that while accounts can (and do) become inactive at any time, most customers either 
drop out relatively early or stay with the firm for an extended period. The four parameters 
estimates have substantial face validity and are consistent with the sample histories pro- 
vided above. 


3 Note that the 20-month order hiatus is more “suspicious” in signaling inactivity for customer B than for 
customer C, since C’s transaction rate appears to be lower than B’s. In other words, long interorder times are 
more consistent with customer C’s behavior, than they are for customer B. So one would expect P[ Active] to 
be smaller for B than for C. 
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Sampling Properties and Sample Design Issues 


We turn next to some sampling properties of the four SMC model parameters. No 
evidence on this issue has appeared previously. We first consider the sensitivity of the 
parameter estimates to the cutoff number of months Ty, following initial purchase for 
which reorders were tracked. The top of half of Table 2 shows the parameter estimates 
obtained as Ty, varies from 12 months to 30 months. We do see some variation in the 
parameters, mostly reflected in the average dropout rate (s/8) which seems to decline 
as customers with longer purchase histories are analyzed. 

The bottom half of Table 2 shows how the parameter estimates change when 7}, is 
held constant (at 12 months) but the cohort group of customers varies. The order of 
entry into the customer base proceeds from top to bottom in this part of the table: thus 
the first row shows estimates using 12 months after trial for the “older” customers— 
those entering April 1986-November 1986. The bottom row shows estimates for newer 
customers (that also have 12 months of data following trial). The variation is not sub- 
stantial, although there may be a slight downward trend in the reorder rate (7/ a) among 
later entrants. 

Another important issue is the sampling variation in SMC model estimates. Table 3 
provides estimated standard deviations and correlations from 100 bootstrap replicates 
(Efron and Gong 1983) of the sample in row 1 of Table 2 (NV = 4050). As a percent of 
the parameter’s value, we note that the average dropout rate (s/G) is the quantity least 
reliably estimated. (The standard deviation is approximately 47% of the parameter’s 
value.) This makes sense, since our only information about dropout is obtained indirectly, 
through the time pattern of reorders. We also note, however, that the (high) degree of 
heterogeneity in dropout rates across customers, indicated by the small value of s = 0.21 
in Table 2, is very reliably estimated. The standard deviation of s across bootstrap samples 
is only 0.01. The transaction rate parameters while a customer is active (r/a) are also 
reliably estimated. 

Of the 15 parameter correlations listed, only three (7, 5), (a, 5), and (r/a, a) are not 
significant at the 0.01 level. As one would expect, the (7, a) and (s, 8) correlations are 
positive, indicating that the estimation is particularly sensitive to attempts to fit the mean 
transaction rate r/a and mean dropout rate 5/8. The other correlation of interest is that 
between r/a and s/8. The high positive value (0.71) presumably stems from the role 
that each quantity plays in determining the expected number of transactions made in 
some time period. That is, a high predicted number of transactions can stem either from 


TABLE 2 
Sensitivity Analysis of SMC Model Estimates: Duration Time Cutoff and Cohort Effects 


Months SMC Model Estimates 
of Data Number of 
Available Customers* 


* All customers entered on/after March 1986 and so provided a complete record of transactions made in the 
months following initial purchase. This is Customer Group G mentioned in Table 1. 
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TABLE 3 
SMC Model Sampling Properties: Bootstrap Estimates of Correlations and Standard Deviations 


Standard Deviation 


[aie sie ae ey | cae 


Note: Table based on parameter estimates from N = 4,050 customers tracked over 30 months since placing 
an initial order. This is Customer Group G mentioned in Table 1. 


Coefficient of 
Variation 


a high transaction rate while active (r/a) or a low dropout rate (s/). In fitting the 
observed number of transactions across bootstrap samples, sometimes the transaction 
rate r/a is increased, with a counterbalancing increase in the dropout rate s/£. 

Finally, we consider one sample issue related to sample design. Calculating SMC model 
parameters across all transactions of all customers in an industrial customer base can be 
difficult, time consuming, and unnecessary. Recall that our subsample of this firm’s 
customer base (an every 10th customer sample) contained over 40,000 customers and 
approximately 450,000 purchase records. While we may be interested in predicting future 
potential separately for all 400,000+ customers (not an overly difficult task), we are not 
sanguine about computing and fitting transaction dynamics across 4.5 million purchase 
records. 

One can think of the sampling frame as having both a “space” (i.e., across customers ) 
and a “time” (amount of time each customer is watched} component. The dataset size— 
i.e., number of purchase records—goes up by expanding either. Real-world applications 


PURO CONFOENCE NTERNAL 


FIGURE 2. Sampling in Space versus Time. 
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of the SMC model need to make some space/time tradeoff; e.g., would it be better to 
analyze 12 months of transaction data from 30,000 customers or 30 months of data for 
15,000 customers (assuming both produce the same number of purchase records to 
analyze)? 

These kinds of questions can be addressed using the bootstrap method applied earlier. 
That is, we can see how the standard deviation of a particular parameter of interest 
changes as the space/time configuration is varied. Of course, this analysis requires actually 
estimating that parameter in each space/time combination of interest. But doing such 
an analysis once (or for some hypothetical SMC parameter values) will be sufficient to 
guide future sample design decisions. 

Figure 2 shows the accuracy of (r/a)’s estimate as the # customers /amount-of-time 
combination varies. Naturally, the standard deviation decreases as each of space and 
time are expanded. More interestingly, we see that accuracy declines very rapidly (i.e., 
the confidence interval expands) as the number of months decreases below 12 nearly 
regardless of the number of customers analyzed. We would conclude here that “good” 
sample designs for estimating r/a should attempt to include a minimum of 12 months 
of reorders and, 1,600 customers, and then as much additional space/time as computa- 
tionally feasible. 


6. Using the Extended SMC Model 


To demonstrate the use of our purchase volume model (14) with SMC’s customer 
transaction/retention model (7), we prepared a variety of five year predictions. The 
predictions cover the period beginning with the end of our calibration window (May 
1989) through April 1994. 

The SMC model parameters were those given at the beginning of the last section. In 
order to extend the model to dollar volume of reorders (e.g., to apply the updating 
formula (14)), we also need an estimate of oy, 64, and E[6]. The within-customer 
variance in amount spent across reorders can be estimated separately for each account 
observed to place at least two reorders. For a sample of 1,056 such accounts (i.e., set (Z) 
in Table 1), the average value of ofy is 6,059. For these same customers, the variance 
in @ across accounts is not directly calculable since we do not observe 6. But the total 
variance in amount spent across both reorders and customers (which equals ow + 04) 
is available, and equals 11,974. This quantity is computed for the same 1,056 accounts 
above using one reorder per customer, that reorder being the most recent one in the 30- 
month period since initial trial. Doing so results in o7, being estimated as 11,974 — 6,059 
= 5,195. The reliability coefficient p, is then 


a — 5,915 _ 
oi t+ a% 11,974 


pi = 0.49. (15) 


For a customer base where E[@] = $100, imagine that a customer was observed to 
place one reorder at $150. Using our variance estimates in (14), our best guess regarding 
this customer’s future reorder size is $125. Similarly, a customer placing one reorder at 
$40 has an expected reorder quantity of $70. Finally, a customer observed to have made 
four reorders with an average size of $150 has an expected reorder size of $140. 


Individual Customer Forecasts 


We first show some illustrative customer predictions in Table 4 for the same ten cus- 
tomers we examined earlier. The expected number of reorders is computed using Equation 
(7). We are able to translate that into a dollar volume projection via (14), listed in the 
right-most column of the table. Note the dramatic differences across customers in future 
value due to substantial differences in reorder rates, dropout rates, and reorder sizes. Our 
expanded SMC model provides a mechanism for combining these various elements of 
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TABLE 4 
5-Year Predictions for Illustrative Customer Accounts 


Observed 
No. of Average Expected 
Customer Reorders Reorder 5-Year $ 
# Observed | P[Active] Size ($) Volume 


Pre-3/86 

Pre-3/86 
5/86 
8/86 
10/86 
12/86 
12/87 
3/87 


1,521.88 
45.09 
424,32 
125.48 
1,148.68 
24.61 
703.49 
29.71 
15.52 
350.33 


$/87 
Pre-3/86 


COOMA NUABWN 
NK OROAN HK NW 


—_ 


customer history into an overall volume projection, for any time period (and duration) 
of interest. Note that prediction for another time duration (say, 24 years) is not obtained 
by merely rescaling the right column of Table 4, due to the progression of customer 
dropout during the period. One must recompute such a forecast via Equations (7) 
and (14). 

To demonstrate this last point and show the buildup of predicted reorder volume with 
time, Table 5 lists aggregate predictions during the five year interval discussed above. 
These expectations are for a set of 5,030 customers (i-e., set (H) in Table 1) whose past 
reorder patterns formed the basis for those predictions. The internal uses for such aggregate 
forecasts are obvious. But other less obvious contexts also exist. For example, various 
legal proceedings sometimes mandate one firm purchasing at fair value the current cus- 
tomer base of another firm. For one of several such examples, see Business Week (1988). 
By extension, the model can be applied to any friendly acquisition valuation decision. 


Customer Segmentation 


There was interest in whether certain a priori-defined customer segments differ in their 
transaction / dropout processes. In addressing this issue we will illustrate a simple method 


TABLE 5 
Forecast Volume Pattern Over a 5-Year Period 


Incremental Cumulative Incremental Cumulative Discounted 
Number of Number of Number of Dollar Dollar Cumulative 
Years Reorders Reorders Volume Volume $ Volume* 


0.5 406,683 406,683 391,006 
1 395,435 802,118 751,675 
1.5 387,251 1,189,369 1,086,748 
2 380,597 1,569,966 1,399,162 
2.5 374,914 1,944,879 1,691,117 
3 369,933 2,314,812 1,964,408 
3.5 365,492 2,680,304 2,220,562 
4 361,459 3,041,764 2,460,889 
4.5 357,788 3,399,552 2,686,566 

354,389 3,753,941 2,898,628 


* Assumes a 10% discount rate. 


Copyright © 2001 All Rights Reserved 


Downloaded from informs.org by [137.132.123.69] on 02 November 2015, at 06:14 . For personal use only, all rights reserved. 


RIGHTS 


58 DAVID C. SCHMITTLEIN AND ROBERT A. PETERSON 


for incorporating other explanatory variables (i.e., besides past purchase history) into 
the SMC model. 

The particular a priori segments were defined by type of business conducted, as indicated 
in the customer’s three-digit SIC code. Separate SMC model estimates for three such 
segments—medical offices, attorneys, and insurance-related films—are shown in Table 
6 and allow us to see which, if any, of the four conceptually distinct SMC parameters 
(i.e., r, r/a, s, and s/@) differ across groups. 

An analysis of the Table 6 data was conducted that essentially treats the four SMC 
parameters as functions of the nominally scaled SIC code in an ANOVA framework. 
The results reveal that each of the groups is unique in a certain respect. Medical offices 
have a particularly high transaction rate while active customers (7/a = 0.664). Insurance- 
related firms have moderate transaction rates, but the highest average dropout rates 
(s/8 = 1.62). Attorneys have the greatest variation across customers in dropout rates: 
they either quit very soon or stay a very long time. In contrast, attrition among insurance- 
related firms is more of a gradual and constant occurrence. (Each of these differences 
across groups is statistically significant at the 0.05 level. Standard errors were estimated 
for each group via bootstrap samples as in preparing Table 3.) 

These differences in dropout dynamics are sketched in Figure 3. Notice that each group 
rapidly loses a significant fraction of new triers to permanent inactivity. This dropout 
levels off quickly after 12 months, however. Further the level of dropout varies substantially 
across the three groups. After two years, 54 percent of the attorneys have become inactive, 
compared to 64 percent for medical offices and 77 percent for insurance-related firms. 
Finally, we note that while medical offices become inactive sooner than insurance-related 
firms, their overall reorder levels are higher than those of insurance-related firms through 
any given time period due to their greater transaction rate (while active). 

With respect to explanatory variable effects, we hope this analysis has convinced the 
reader that: 

(1) Purchase history is a special explanatory variable for which a specific model ought 
to be developed. 

(2) Customers can and do differ with respect to various aspects of the transaction / 
retention process. Each of these aspects in turn has its own unique impact on customer 
predictions of interest, e.g., P[ Active] or the expected number of future transactions. 

(3) The four SMC model parameters (7, r/a, 5, 5/8) provide conceptually distinct 


TABLE 6 
SMC Model Estimates for Different Customer Segments 


Insurance-related 
Firms 


3-Digit SIC 


No. of Customers* 


* To be included in the estimation, a customer must have entered on/after March 1986 and had 24 months 
of purchase data available. 


Serve 
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insurance-related Firms 
4 


Medical Offices | 
Proportion -_ 

of Customers : Attorneys 
No Longer Active . 


0123 6 12 18 24 36 


Months Since Initial Order 


FIGURE 3. Illustrative Customer Dropout Patterns for Three Customer Segments. 


and convenient “entry points” (i.e., dependent variables) for the other exogenous variables 
of interest. 


7. Validating the Model Assumptions and Model Predictions 


We first take a systematic look at the SMC model’s validity. Since the transactions 
component (i.e., a negative binomial distribution (NBD) model) has a fair amount of 
empirical support already (see Morrison and Schmittlein (1988) and Uncles and Ehren- 
berg (1990)), we will concentrate our attention on the customer retention /dropout com- 
ponent. We then will examine our assumption that the simple SMC model’s transaction / 
retention process operates independently from the dollar volume of reorders. Finally, 
we address the predictive validity of the entire extended SMC model. 


Validation of Dropout Phenomenon 


Since the dropout phenomenon is not directly observable, its presence (and represen- 
tation by the SMC model) must be examined through its impact on certain transaction 
patterns. One such pattern is the trend over (calendar) time in orders placed by a certain 
set of customers. We consider here a random sample of customers having an initial 
purchase date prior to April 1988 (i.e., possibly entering before March 1986 as well, and 
being left-filtered ). During the 12 months April 1988—March 1989, the number of orders 
placed by this static set of customers was: 


No. of Orders 


1. April-June 
2. July-September 

3. October-December 
4. January-March 


1,700 
1,670 


Such a drop is suggestive of customers becoming inactive. Of course, this kind of 
simple statistic may also be influenced by other factors, e.g., a peculiar seasonality in 
order placements or other calendar-time effects, such as rising prices. 

But other analyses tend to rule out these competing explanations and corroborate the 
dropout hypothesis. First, we note that the special (limiting) case of the SMC model 
without dropout—i.e., where all customers have a dropout rate 7 = 0— is the well-known 
stationary NBD model (Ehrenberg 1988). We fit the NBD to the observed distribution 


Copyright © 2001 All Rights Reserved 


Downloaded from informs.org by [137.132.123.69] on 02 November 2015, at 06:14 . For personal use only, all rights reserved. 


RIGHTS 


60 DAVID C. SCHMITTLEIN AND ROBERT A. PETERSON 


for number of transactions in a 30-month period, using data only from customers that 
placed a reorder (i.e., omitting the zero-class). We then use the NBD to predict how 
large this zero class should be, if there is truly no dropout of customers. If the actual 
number of nonbuyers is greater than the number forecast by the NBD, we have further 
evidence of the dropout phenomenon. 

The actual distribution of reorders, and the NBD model fit, was* 


Number of Reorders in 30 Months 


Observed Number of Customers 
NBD Fitted Number of 
Customers 


The NBD predicted number of nonbuyers over this 30 months is 418 customers, which 
is well below the 520 nonbuying customers observed. Therefore, the (stationary) NBD 
model substantially underpredicts the number of accounts that do not reorder—again 
consistent with the dropout hypothesis. Note that the 30-month histogram takes in a 
longer period of time than the quarterly order pattern presented earlier. Moreover, it 
covers different calendar time periods for different customers, depending on the customer’s 
entry date. 

One remaining alternative explanation for these results would arise from the presence 
of a substantial subgroup of customers who make an initial purchase and then never 
consider reordering. In other words, there is a hard-core subgroup that will never rebuy, 
and another (stationary ) group that will never drop out (and who follow the NBD model 
in reordering ). This model has some popularity and conceptual appeal (Morrison 1969; 
Morrison and Schmittlein 1988; Schmittlein et al. 1992), and its extra parameter (relative 
to the NBD) would indeed enable it to fit the high number of nonbuyers just encountered. 
Such a Non-User NBD (NUNBD) model does not allow for the continual dropout over 
time envisioned in SMC. Thus, like the NBD, the NUNBD is a stationary model for 
reorders: the “user” subgroup does not drop out. For any such model the incremental 
total number of orders placed in each successive time period after initial trial should be 
approximately equal. This provides a basis for distinguishing the NUNBD from the SMC 
model, which assumes that incremental orders decrease over time as customers drop out. 

The last column of Table 7 reports the incremental order pattern. It shows an unmis- 
takable decline consistent with the SMC model of dropout. Finally, we note that the 
SMC model does effectively capture these dynamics of the reorder process. Table 7 also 
shows the model’s fit to the buildup of cumulative reorders as time passes since the 
customer’s initial order. For example, by the end of the first year after trial, customers 
have repurchased 1.65 times on average. By the end of the second year, cumulative 
repurchases are 2.90—less than double the one-year total. This increasing-at-a-decreasing 
rate pattern again suggests that dropout is affecting the reorder level. Based on the results 
reported thus far, we conclude that the SMC model seems to capture effectively the 
customer transaction /retention process. 


Independence of the Transaction Process from the Dollar Value of Reorders 
We next validate the extended SMC model’s assumption that reorder dollar volume 
is not seriously affected by reorder timing. Table 8 contains the results of this analysis. 


4 The sample was N = 1,829 customers that entered both on/after March 1986 and before December 1986 
(i.e., (D) in Table 1). They could therefore all be observed for 30 months following their initial order. MLE’s 
for the NBD parameters were r = 0.824 and a = 6.00. 


ee OPYNIQNTS 2Z00T All RiGhtS RESSTVER 


Downloaded from informs.org by [137.132.123.69] on 02 November 2015, at 06:14 . For personal use only, all rights reserved. 


RIGHTS 


CUSTOMER BASE ANALYSIS 


Time Period 
Since Initial Order 
(Months) 


10 
li 
12 


13 
14 
15 


16 
17 
18 


19 
20 
21 


22 
23 
24 


25 
26 
27 


28 
29 
30 


TABLE 7 
Cumulative Reorder Levels Following an Initial Order 


Actual Cumulative 


Number of Reorders SMC Model- 
Per Customer* Placed Estimated Number 
Through Period of Reorders 
0.30 .20 
0.43 .37 
0.54 52 
0.66 67 
0.78 80 
0.89 94 
1.02 1.06 
1.15 1.18 
1,26 1.30 
1,39 1.42 
1.52 1.54 
1.65 1.65 
1.77 1.76 
1.89 1.87 
2.00 1,98 
2.10 2.08 
2.20 2.19 
2.32 2.29 
2.42 2.39 
2.52 2.49 
2.61 2.59 
2.71 2.69 
2.80 2.79 
2.90 2.89 
2.98 2.98 
3.07 3.08 
3.16 3.17 
3.25 3.27 
3.33 3.36 
3.42 3.45 


Actual Incremental 


Number of 


Reorders Within 


Each Three-Month 


Period 


0.542 


0.349 


0.374 


0.383 


0.351 


0.318 


0.290 


0.291 


0.263 


0.258 


* N = 4,050 customers with initial purchase after March 1986 and before December 1986. 


61 


Each of a set of 17,825 customers (i.e., set (C) in Table 1) was monitored following a 
(randomly chosen) reorder to record the timing of the next reorder (if such an event 
occurs). Column (b) lists the number of customers whose reorder occurred in month t 
following the previous order. Column (c) gives the number of customers who had not 
yet reordered at the start of month ¢, but who were observed during month ¢ and could 
have reordered. Thus the hazard rate (column (d)) for month ¢ is the fraction of those 
that could reorder who actually did (i.e., column (b) + column (c)). The C.D-F. in 
column (e) estimates the proportion of the population that reorders by each month. 
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TABLE 8 
The Reaction of Dollar Volume Per Reorder to Interorder Time 


(c) 
(b) # Customers (e) 
# Customers Who Could Cumulative 
Reordering Reorder Distribution 


(f) 
Avg. $ Amount 
Per Reorder 


(a) 


Wern~ AU a WH = 


Lastly, column (f) contains the average amount bought per reorder, for orders placed 
in month ¢. 

Our main observation is that the average reorder amount seems unrelated to the number 
of months since past purchase: it remains about $110 through the table. This is evidence 
that at least 6 and the transaction rate \ are independent. We also note in passing that 
the declining hazard rate and the C.D.F. reaching an asymptote below 1.0 are together 
further evidence of the dropout process as envisioned in SMC. 


Predictive Validity: Aggregate Reorder Patterns 


The predictive validity of the simple SMC model is illustrated in Figure 4. For the 
same customers as in Table 7, we estimate the SMC model on the first 12 months of 
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0.1 


5 25 


16 
Months Since initial Order 


FIGURE 4. Predicting Incremental Orders. 


reorder data after initial trial, and then predicted the incremental number of reorders in 
each of months 12-30. The slow decline in the reorder level shows up in both the model 
predictions and the empirical results. We next validate the separate customer-by-customer 
forecasts, in two ways. First, we use a telephone interview to see if customers predicted 
to still be active (or inactive) by the model in fact see themselves that way. Second, we 
compare predicted versus actual numbers of reorders, and dollar volume of reorders, for 
individual customers during a future time period. 


Convergence of the Model with Survey Seif-report 


One of the objectives of the research was to empirically compare the predictions of 
the SMC model as to the likelihood of a customer being currently active with the cus- 
tomer’s self-report of its own activity status. To accomplish this, telephone interviews 
were attempted with a sample of 400 of the 5,030 customers analyzed earlier. Up to three 
attempts were made to contact each sampled customer. The interview documented the 
disposition of the customer with respect to whether it was still active, had gone out-of- 
business, purchased supplies from a competing company, and so forth. For our purposes, 
the outcome measure from each interview is essentially a binary indicator: either the 
customer views itself as still active for this supplier (and hence expects to reorder at some 
unspecified time in the future) or has become inactive, with no expectation of any future 
reorders. Our interest is in the association between this self-report measure and the SMC 
model’s probability that the customer is still active. Presumably, a high SMC model 
probability should be accompanied by a higher propensity for the customer to report an 
active status. 

Of course, neither of these quantities is a perfect indicator of future customer behavior. 
This is so by definition for the (probabilistic) SMC model. And we know that customer 
intentions are imperfect predictors of future behavior, i.e., in general not all customers 
who say they will “definitely buy” will in fact do so (Morwitz and Schmittlein 1992). 
Thus, relative to future behavior the model and self-report activity status each contain 
an idiosyncratic error, and will not be perfectly associated with each other. As a result, 
we would expect the relation between the model’s probability and the proportion who 
self-report an active status to show a positive correlation, but a slope less than one. 

This sort of convergent validity is what we find in the results reported in Figure 5. The 
possible SMC model probabilities were split into deciles, and 40 customers with a model- 
based probability in each decile were chosen to contact by telephone. The proportion of 
respondents who report an active status is plotted for each decile. For instance, 12.5% 
of those customers predicted to have an activity probability between 0.1 and 0.2 actually 
said they were still active. That percentage self-reporting activity increased to 25% for 
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Actual Fitted 
ost Proportion Proportion 
Survey 
Self-Report 
Proportion 
of Customers 
Still Active 


0 4 4 1 1 1 — 1 i 


O41 $42 2.30.34 45 §&6 67 7-8 89 91 


SMC Mode! Probability that 
the Customer is Still Active 


FiGureE 5. Identifying Active Customers: Convergence Between Model Estimates and Survey-based Self- 
report. 


customers with model probabilities between 0.3 and 0.4. Both the raw proportions and 
the linear regression fit through them show the significant positive correlation and slope- 
less-than-one hypothesized above. (The regression R? was 0.81.) The substantial con- 
vergent validity here broadens our confidence both in the SMC model’s underlying process 
assumptions and in its predictions. 


Predictive Validity: Individual Customer Transactions 


Probably the best single indicator of the model’s effectiveness is its ability to predict 
the frequency and size (dollar volume) of future transactions for individual customers. 
Values for the seven extended SMC parameters (r, a, 5, 8, 0%, ov, E[6]) were taken to 
be the estimates reported earlier in the paper. Then, for a different set of customers over 
a later six-month period, actual reorders were compared to the extended SMC model 
predictions, those predictions being based on the model parameters and, for each customer, 
on the purchase history observed to date.° 

One quantity of interest is the number of reorders a customer placed during the val- 
idation period. The mean squared error (MSE) for the SMC model predictions was 
0.627. To interpret this number, we also computed predictions for a natural baseline 
model—namely, a simple extrapolation of that customer’s observed history. In other 
words, the customer’s observed reorder rate prior to the six-month validation period was 
used to predict reorder frequency during the latter period. The MSE for the baseline 
forecasts was 1.006, indicating a 38% improvement in squared error for the SMC model. 

We also validated the actual total dollar volume generated by each customer in the 
six-month period. Here, the baseline forecast was the customer’s observed historical reor- 
der rate (on a six-month basis) multiplied by the average dollar volume per reorder 
exhibited by the customer to date. The baseline forecast MSE was 19,173 while the MSE 
for the extended SMC model was 12,884, an improvement of 33% for our model. In 
short, all of the evidence in this section indicates that the extended SMC model captures 
the main qualitative characteristics of the customer reorder timing/ reorder size/retention 
process, and also predicts future customer behavior relatively well. 


8. Discussion and Conclusions 


We have focused on validating and extending the SMC model for customer base anal- 
ysis. We hope that the family of empirical results presented convinces the reader that the 


5 A subset of customer group H from Table 1 was used, comprised of all who had placed an initial order on / 
after May 1986 and before November 1988. Predictions were made for the six-month period December 1988- 
May 1989. A total of 1,606 customers met the criteria, and therefore each had between 0 and 30 months of 
observed purchase history, to serve as the basis for the six-month prediction. 
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expanded model is useful for aggregate customer forecasts, customer segmentation, in- 
dividual customer valuation, and inferences about the reorder /dropout process actually 
taking place among customers. We have shown how exogenous factors affecting this 
process can be incorporated in the expanded SMC model. The empirical results provide 
answers to the key managerial questions raised earlier in the paper. 

We think that the application here convincingly indicates that customer dropout can 
be a real and significant phenomenon and that it can be signaled account-by-account in 
the pattern of past orders. It is hoped that this paper will help dispel the notion that 
industrial purchase patterns are not amenable to significant modeling efforts of the general 
sort that are routine in researching consumer markets. Of course, industrial markets are 
somewhat different and require analyses sensitive to these differences. This is reflected 
in our current work in two ways: the focus on dollar volume and the use of purchase 
data only for the firm in question (i.e., ignoring orders placed with competitive suppliers 
that the firm will not often be aware of in any systematic way).° 


6 This paper was received July 16, 1991, and has been with the authors 13 months for 2 revisions. Processed 
by Scott A. Neslin, Area Editor. 


Appendix Al 


The Probability that a Given Customer is Still Active 


Case 1: a > 6. The probability that a customer is still active, given an observed history of X purchases in 
time (0, 7) since trial, with the most recent purchase at time ¢, is 


7 7 s a + T\**/B + Ts — 
PlActivelr, a, 5,8,.X=x,0, T= {1+ 5 |(22 7) (EAT) Fea, buss 2000) 


Ss -1 
- (FEF) Fea, bi en 279)]} (Al) 


where a) =r+x+s;b,=s+ looq=r+x+s54+152,(y) =(a- B)/(aty). 
Case 2: a < 6. 


. _ _ Ss at T\'**/B+T\5 oo 
PlActivelr, a, 5, 8,.X= x, 7)= {1+ || ( TT) Faas bai en 200 


+ T\"tx ~t 
-(FSa) Fan dies zcry] |” (ay 
where 22 =r+x+s; b= r+ x;Q=rt+x+st+1;2(y)=(B—-—a)/(B+y). 
Case 3: a = B. 
S at T\tstx ~l 
P[ Acti = B,X= = 41 +—— - ; 
[Active|r, s, a = 6,X = x,t, T] {! l(a i}} (A3) 


In (AI) and (A3), F(a, 6; c; z) is the Gauss hypergeometric function (Abramowitz and Stegun 1972, p. 
588). It can be computed using either numerical integration or the algorithms in Luke (1977). When no 
transactions are observed in the period (0, 7’), the probability that the customer is still active is obtained by 
substituting ¥ = 0 and ¢ = 0 in (A1), (A2), or (A3). 


Appendix A2 


A Two-step Estimation Method for the SMC Model 


As in SMC, we choose parameter values to best fit the observed number of purchases in a period of duration 
T predicted by Equation (5). Note that (5) only depends on three of our four parameters (7/a, 5, 8) fora 
period of any duration T. For T = 1, 2,..., then, we have an observed number of purchases per customer and 
a prediction via (5). We choose (r/a, s, 8) to minimize the sum (over 7°) of squared discrepancies in these 
predictions. Given estimates for these three parameters, we obtain a separate estimate of a (and hence r) by 
fitting the variance in purchases predicted by Equation (6). As for (5), we minimize (over a, given r/a, 5, 8) 
the sum of squared errors over T = 1, 2,.... 
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Of course, a customer can only provide an observation for the left side of (5) or (6) if the initial purchase 
was made at least T time periods ago. (That is, a customer “in the system” for only three months cannot provide 
an estimate of average purchase frequency over a six-month period after trial.) Thus, we will choose some cutoff 
length of time, 7;, to screen customers for inclusion in the estimation algorithm: only those customers having 
been in the system for at least time 74, will be used. 

The estimation procedure can be summarized as: 

Step 1: Choose Ty. For all customers making an initial purchase at least 7), time units ago, choose (r/a, 
5, 8) to minimize 


Tu 
D (E{X|r/a, s, 8, T) ~ Xr(Tu))? (A4) 


T=1 


where X;(Ty,) is the average number of transactions per customer through those customers’ first 7 months. 
We use an iterated pattern search to minimize (A4). 


Step 2: Using Step 1 estimates 7/a, §, 8; for each T = 1,..., Ty, choose & to equate Var [X |r, a, s, B, 
T) with Var [X7({ Ty,)] where Var [X7(Ty,)] is the variance across customers (the same individuals as in Step 
1) in the number of transactions per account through those customers’ first T months after initial purchase. 
Average these estimates to obtain a: 


1 \ ™ 
a= (z,) > ar. (AS) 


For the main empirical findings of this paper, we use 7y, = 30 months. The estimation section in the text 
reports some evidence regarding sensitivity of (7, a, §, 8) to choice of Ty. 
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